|
|
|
| Home | Wayback Machine | Archive-It | Blog | Heritrix |
| Anonymous User (login or join us) |
Internet Archive crawldata from Aaron Swartz Crawl, captured by crawl345.us.archive.org:aaronswartz from Mon Jan 21 08:16:03 PST 2013 to Mon Jan 21 00:49:07 PST 2013.
This item is part of the collection: Away from Keyboard: Aaron H. Swartz
Identifier: AS-20130121081603-crawl345
Contributor: Internet Archive
Crawljob: aaronswartz
Creator: Internet Archive
Date: 2013
Firstfiledate: 20130121081633
Firstfileserial: 02364
Identifier-access: http://www.archive.org/details/AS-20130121081603-crawl345
Lastdate: 20130121004907
Lastfiledate: 20130121084640
Lastfileserial: 02373
Mediatype: web
Numwarcs: 10
Operator: lekash@archive.org
Scandate: 20130121081633
Scanner: crawl345.us.archive.org
Scanningcenter: sanfrancisco
Sizehint: 10220722505
Sponsor: Internet Archive
Publicdate: 2013-01-21 21:25:29
Addeddate: 2013-01-21 21:25:29
Imagecount: 218812
Keywords: crawldata
| Information | Format | Size |
| AS-20130121081603-crawl345_files.xml | Metadata | [file] |
| AS-20130121081603-crawl345_meta.xml | Metadata | 1.4 KB |
| Other Files | Web ARChive GZ | WARC CDX Index | Item CDX Index | Item CDX Meta-Index | Text |
| AS-20130121081603-02364.warc.gz |
1.1 GB
|
728.5 KB
|
|||
| AS-20130121081603-crawl345.cdx.gz |
12.3 MB
|
||||
| AS-20130121081603-crawl345.cdx.idx |
8.6 KB
|
||||
| AS-20130121081913-02365.warc.gz |
967.1 MB
|
1.6 MB
|
|||
| AS-20130121082300-02366.warc.gz |
954.5 MB
|
1.4 MB
|
|||
| AS-20130121082609-02367.warc.gz |
953.9 MB
|
1.7 MB
|
|||
| AS-20130121082847-02368.warc.gz |
964.5 MB
|
1.3 MB
|
|||
| AS-20130121083150-02369.warc.gz |
953.7 MB
|
1.7 MB
|
|||
| AS-20130121083525-02370.warc.gz |
953.7 MB
|
1.3 MB
|
|||
| AS-20130121083917-02371.warc.gz |
953.7 MB
|
1.1 MB
|
|||
| AS-20130121084318-02372.warc.gz |
976.4 MB
|
951.2 KB
|
|||
| AS-20130121084606-02373.warc.gz |
953.7 MB
|
1.4 MB
|
|||
| MANIFEST.txt |
660.0 B
|